Bayesian speech synthesis framework integrating training and synthesis processes

نویسندگان

Kei Hashimoto

Yoshihiko Nankaku

Keiichi Tokuda

چکیده

This paper proposes a speech synthesis technique integrating training and synthesis processes based on the Bayesian framework. In the Bayesian speech synthesis, all processes are derived from one single predictive distribution which represents the problem of speech synthesis directly. However, it typically assumes that the posterior distribution of model parameters is independent of synthesis data, and this separates the system into training and synthesis parts. This paper removes the approximation and derives an algorithm that the posterior distributions, decision trees and synthesis data are iteratively updated. Experimental results show that the proposed method improves the quality of synthesized speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian approach to Hidden Semi-Markov Model based speech synthesis

This paper proposes a Bayesian approach to hidden semiMarkov model (HSMM) based speech synthesis. Recently, hidden Markov model (HMM) based speech synthesis based on the Bayesian approach was proposed. The Bayesian approach is a statistical technique for estimating reliable predictive distributions by treating model parameters as random variables. In the Bayesian approach, all processes for con...

متن کامل

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech

In statistical speech synthesis, the quality of the synthesized speech depends on the quality of training data. As the sampling rate of speech is one of the effective factors, speech data has been recently recorded at a high sampling rate. However, the sampling rates of speech data recorded in the past or collected from the Internet were often low. Therefore, to use these speech data effectivel...

متن کامل

Overview of NIT HMM - based speech synthesis system for Blizzard Challenge 2010

This paper describes a hidden Markov model (HMM)-based speech synthesis system developed for the Blizzard Challenge 2010. This system employs STRAIGHT vocoding, minimum generation error (MGE) training, minimum generation error linear regression (MGELR) based model adaptation, the Bayesian speech synthesis framework, and the parameter generation algorithm considering global variance. The real-ti...

متن کامل

Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis

This paper investigates a multi-speaker modeling technique with shared prior distributions and model structures for Bayesian speech synthesis. The quality of synthesized speech is improved by selecting appropriate model structures in HMMbased speech synthesis. Bayesian approach is known to work for such model selection. However, the result is strongly affected by prior distributions of model pa...

متن کامل

Speech Parameter Sequence Modeling with Latent Trajectory Hidden Markov Model

The weakness of hidden Markov models (HMMs) is that they have difficulty in modeling and capturing the local dynamics of feature sequences due to the piecewise stationarity assumption and the conditional independence assumption on feature sequences. Traditionally, in speech recognition systems, this limitation has been circumvented by appending dynamic (delta and delta-delta) components to the ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Bayesian speech synthesis framework integrating training and synthesis processes

نویسندگان

چکیده

منابع مشابه

A Bayesian approach to Hidden Semi-Markov Model based speech synthesis

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech

Overview of NIT HMM - based speech synthesis system for Blizzard Challenge 2010

Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis

Speech Parameter Sequence Modeling with Latent Trajectory Hidden Markov Model

عنوان ژورنال:

اشتراک گذاری